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Abstract — We define the AWGNC, BSC, and max-fractional 
pseudocodeword redundancy p(C) of a code C as the smailest 
number of rows in a parity-check matrix such that the cor- 
responding minimum pseudoweight is equal to the minimum 
Hamming distance of C. We show that most codes do not have 
a finite p(C). We also provide bounds on the pseudocodeword 
redundancy for some families of codes, including codes based on 
designs. 

I. Introduction 

Pseudocodewords represent the intrinsic mechanism of fail- 
ure of binary linear codes under linear-programming (LP) or 
message-passing (MP) decoding. In JT|, the pseudocodeword 
effective Euclidean weight, or pseudoweight, was associated 
with any pseudocodeword. This concept of pseudoweight was 
shown to play an analogous role to that of the signal Euclidean 
distance (AWGNC) or Hamming distance (BSC) in the ML 
decoding scenario. The minimum pseudoweight of the code 
C with respect to a parity-check matrix H is defined as 
the minimum over all pseudoweights of nonzero pseudocode- 
words; this may be considered as a first-order measure of 
decoder error-correcting performance for LP or MP decoding. 
Typically, a lower minimum pseudoweight corresponds to a 
higher probability of decoding error. Another measure closely 
related to BSC pseudoweight is the max-fractional weight 
(pseudoweight). It serves as a lower bound on both AWGNC 
and BSC pseudoweights. 

In order to minimise the decoding error probability under 
LP (or MP) decoding, one might want to select a matrix 
H which maximises the minimum pseudoweight of the code 
for the given channel. However, generally it is not clear 
how this goal may be achieved. Adding redundant rows to 
the parity-check matrix introduces additional constraints on 
the so-called fundamental cone, and may thus increase the 
minimum pseudoweight. However, such additions increase the 
decoding complexity under MP decoding, especially since 
linear combinations of low-density rows may not yield a 
low-density result. On the other hand, there exist classes of 
codes for which sparse parity-check matrices exist with many 
redundant rows, e.g. Q. 

For the AWGNC, BEC, BSC pseudoweights, and max- 
fractional weight, define p AWG Nc(C), p BEC (C), p BS c(C), and 
Pmax fr a c(C), respectively, to be the minimum number of rows 
in any parity-check matrix H such that the minimum pseu- 
doweight of C with respect to this matrix is equal to the 



code's minimum distance D. For the sake of simplicity, we 
sometimes use the notation p(C) when the type of channel is 
clear from the context. The value p(C) is called the (AWGNC, 
BEC, BSC, max-fractional) pseudocodeword redundancy (or 
pseudoredundancy) of C. If for the code C there exists no such 
matrix H, we say that the pseudoredundancy is infinite. 

The BEC pseudocodeword redundancy was studied in Q, 
where it was shown that for any binary linear code C there 
exists a parity-check matrix H such that the minimum pseu- 
doweight with respect to this H is equal to D, and therefore 
the BEC pseudocodeword redundancy is finite for all codes. 
The authors also presented some bounds on p B Ec(C) for general 
linear codes, and for some specific families of codes. 

In this work, we address the analogous problem for the 
AWGNC, BSC, and max-fractional pseudoweight. We show 
that for most codes there exists no H such that the minimum 
pseudoweight (with respect to H) is equal to D, and there- 
fore the AWGNC, BSC, and max-fractional pseudocodeword 
redundancy (as defined above) is infinite for most codes. For 
some code families for which the pseudoredundancy is finite, 
we provide upper bounds on its value. 

II. General Settings 

Let C be a code of length n £ N over the binary field F2, 
defined by 

C = ker H = {c e F' 2 l : Hc T = T } (1) 

where H is an m x n parity-check matrix of the code C. 
Obviously, the code C may admit more than one parity-check 
matrix, and all the codewords form a linear vector space of 
dimension k > n — m. We say that k is the dimension of the 
code C. The rate of the code C is defined as R(C) = k/n and 
is equal to the number of information bits per coded bit. We 
denote by D the minimum Hamming distance (also called the 
minimum distance) of C. The code C may then be referred to 
as an [n, k, D] linear code over F2. 

Denote the set of column indices and the set of row 
indices of H by X = {1, 2, . . . , n} and J = {1,2,..., m}, 
respectively. For j e J , we denote X,- = {i El : Hjj ^ 0}, 
and for i <E X, we denote J, = {j E J : H jti + 0}. The 
fundamental cone of H, denoted JC(H), is defined as the set 
of vectors x £ R n that satisfy 

Vj £j,We Xj : x e < x i > ( 2 ) 



\fi £ I : Xi > 



(3) 



The vectors x G R™ satisfying (f2| and (f3) are called 
pseudocodewords of C with respect to the parity-check matrix 
H. Note that the fundamental cone K{H) depends on the 
parity-check matrix H rather than on the code C itself. At 
the same time, the fundamental cone is independent of the 
underlying communication channel. 

The BEC, AWGNC, BSC pseudoweights, and max- 
fractional weight of a nonzero pseudocodeword x E fc(H) 
were defined in [lj and |4| as follows: 

W BEC (£C) = |supp(x)| , 



Wv 



:(*) = 



(E 



Let x' be a vector in R" with the same components as x but 
in non-increasing order. For i — 1 < £ < i, where 1 < i < n, 
let 0(£) = x\. Define $(£) 4 j* 0(f) d£' and 

w BSC ( : r) =2$- 1 ($(n)/2) . 
Finally, the max-fractional weight of x is defined as 

w maxrrac (a;) = 2 . 

maxjgi x l 

We define the BEC minimum pseudoweight of the code C 
with respect to the parity-check matrix H as 



min w BEC (cc) . 

xeK.(H)\{0} 



The quantities w™* c (H), w™ n (if) and w^Jff) are de- 
fined similarly. Note that all four minimum pseudoweights are 
upper bounded by D, the code's minimum distance. 

Then we define the BEC pseudocodeword redundancy of 
the code C as 



p BEC (C) = inf{#rows(iO | ker H = C, w™ a (H) = D} , 

where inf = oo, and similarly we define the pseudocode- 
word redundancies /0 A wgnc(C), p ssc (C) and jCW^C) for the 
AWGNC and BSC pseudoweights, and the max-fractional 
weight. We remark that all pseudocodeword redundancies 
satisfy p(C) > n — k. 

III. Basic Connections 

The next lemma is taken from J4). 

Lemma 3.1: Let C be a binary linear code with the parity- 
check matrix H. Then, 

wrLW < ws&c(#) < w B r(H) , 
»cl(») < w B r(/f) < w^(jf) . 

The following theorem is a straightforward corollary to 
Lemma 13.11 

Theorem 3.2: Let C be a binary linear code. Then, 

(C) > Fawcnc (C) > Pbec(C) , 

(C) > /? BSC (C) > p BEC (C) . 

We note that for geometrically perfect codes, a class of 
codes defined and characterised in 0, all four pseudocode- 
word redundancies are finite. 



IV. PSEUDOREDUNDANCY OF RANDOM CODES 

We begin with the following lemma. 

Lemma 4.1: For the binary linear code C of length n, 
let d be the minimum distance of the dual code. Then, the 
minimum AWGNC pseudoweight of C (with respect to any 
parity-check matrix H) satisfies 



mm < 

AWGNC — 



(n + d- 2) 2 
(d-l) 2 + (n-l) 



(4) 



Proof: Consider the pseudocodeword x = 
{x\,X2, ■ • ■ , x n ) = (d — 1, 1, . . . , 1). Since d is the minimum 
distance of the dual code, every row in H has weight at 
least d. Therefore, all inequalities (fJI and (O are satisfied for 
this x, and so it is indeed a legal pseudocodeword. Finally, 
observe that the AWGNC pseudoweight of x is given by the 
right-hand side of @. ■ 
Next, we take a random binary linear code C of a fixed rate 
R and arbitrary length n (for n — > oo). It is well known that 
the relative minimum distance 6 — D/n of C attains, with 
probability approaching 1 as n — > oo, for any fixed small 
e > 0, the Gilbert- Varshamov bound 

S> H^(l-R)-€, 

where H 2 ^ 1 (-) is the inverse of the binary entropy function 
H2(p) = —p^og 2 p — (1 — p)log 2 (l — p) (see [6, Theorems 
4.4, 4.5, and 4.10] for details). 

The dual code C of C can be viewed as a random code 
also, and so with high probability the rate R = 1 — R and the 
relative minimum distance S 1 - = d/n of the dual code attain 
the Gilbert- Varshamov bound 

S ± > it = Ha x (l -R ± )-e = H^(R) - e , 

Note that dU may be written in terms of the relative 
minimum distance S 1 - of the dual code as follows: 



< 



(1 + 5 1 - -2/nf 



(5- 1 -l/n) 2 + (l/n-l/n 2 ) 



(5) 



For large n, the minimum pseudoweight of the code C ± 
is bounded from above by the constant (1 + + e' < 

(1 + 1 / fi) 2 + e' for some small e' > — this bound does not 
depend on n. On the other hand, C is a random code and so 
its minimum distance satisfies the Gilbert- Varshamov bound, 
namely 

D> (H 2 - 1 (l-i?)-e) - n, 

which increases linearly with n for a fixed R. 

We obtain that for any H, there is a gap between the min- 
imum pseudoweight and the minimum distance of a random 
code C. Therefore, we have the following corollary. 

Corollary 4.2: Let < R < 1 be fixed. For n large 
enough, for a random binary linear code C of length n 
and rate R, there is a gap between the minimum AWGNC 
pseudoweight (with respect to any parity-check matrix) and the 
minimum distance. Therefore, the AWGNC pseudoredundancy 
is infinite for most codes. 
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The following lemma is a counterpart of Lemma |4T| for the 
BSC. 

Lemma 4.3: Let C be a binary linear code of length n, 
and let d be the minimum distance of the dual code. Then, 
the minimum BSC pseudoweight of C (with respect to any 
parity-check matrix H) satisfies 



w™<2\n/d] 



Proof: Consider the pseudocodeword 
x = (xi,x 2 , . . . ,x n ) = (d- l,d— 1, . . . ,d— 1, 1, 1, . . . , 1) 



for some positive integer r. This x is then a legal pseudocode- 
word; since d is the minimum distance of the dual code, every 
row in H has a weight of at least d, and so, all inequalities (O 
and (f3]l are satisfied by this x. 

If r(d — 1) > n — t then by the definition of the BSC 
pseudoweight w BSC (:c) < 2t. This condition is equivalent to 
rd > n. Therefore, we set t = \n/d~\. For the corresponding 
x, the pseudoweight is less or equal to 2r = 2\n/d~\. ■ 

Similarly to the AWGNC case, let C be a random binary 
code of length n and a fixed rate R. The parameters R 1 - and 8 1 - 
of its dual code C x attain with high probability the Gilbert- 
Varshamov bound 8 > /i. 

From Lemma 14.31 for all n, the pseudoweight of the 
code C x is bounded from above by 

2\n/d] < 2/S ± +2< 2/fi + 2, 

which is a constant. On the other hand, C is a random code 
and its minimum distance also satisfies the Gilbert- Varshamov 
bound, so it increases linearly with n. It follows that for any 
H, there is a gap between the minimum BSC pseudoweight 
and the minimum distance of a random code C. 

Corollary 4.4: Let < R < 1 be fixed. For n large 
enough, for a random binary linear code C of length n and 
rate R, there is a gap between the minimum BSC pseu- 
doweight (with respect to any parity-check matrix) and the 
minimum distance. Therefore, the BSC pseudoredundancy is 
infinite for most codes. 

The last corollary disproves the conjecture in J7) that the 
BSC pseudoredundancy is finite for all binary linear codes. 

Example 4.1: Consider the [23,12] Golay code having 
minimum distance D = 7. The minimum distance of its dual 
code is d = 8. We can take a pseudocodeword x as in the 
proof of Lemma l4~3l with r = 3. We have w BSC (£c) < 2r = 6, 
thus obtaining that the minimum distance is not equal to the 
minimum pseudoweight. 

Similarly, for the [24,12] extended Golay code we have D = 
d = 8, and by taking r = 3 we obtain w BSC (cc) < 2r = 6. 

Note however that the presented techniques do not answer 
the question of whether these Golay codes have finite AWGNC 
pseudoredundancy. 



V. Codes Based on Designs 

Definition 5.1: A partial (w C) A) design is a block design 
consisting of an rt-element set V (whose elements are called 
points) and a collection of m subsets of V (called blocks) such 
that every point is contained in exactly w c blocks and every 
2-element subset of V is contained in at most A blocks. The 
incidence matrix of a design is an m x n matrix H whose 
rows correspond to the blocks and whose columns correspond 
to the points, and satisfies Hjj — 1 if block j contains point i, 
and Hj i = otherwise. 

If each block contains the same number w r of points and 
every 2-element subset of V is contained in exactly A blocks, 
the design is said to be an (n, w r ,X) balanced incomplete 
block design (BIBD). 

Note that for a BIBD we have nw r — mw r and also 



w c (w r - 1) = A(n - 1) , 



(6) 



so all other parameters may be deduced from (n, w r ,X); in 
particular, w c = " _■■ A. Note that [9] and [10] consider 
parity-check matrices based on BIBDs; these matrices are the 
transpose of the incidence matrices defined here. 

We have the following general result for codes based on 
partial (w c ,X) designs. 

Theorem 5.1: Let C be a code with parity-check matrix 
H, such that a subset of the rows of H forms the incidence 
matrix for a partial (w c , A) design. Then the minimum max- 
fractional weight of C with respect to H is lower bounded 
by 

(7) 



w c 

^ 1 + T 



For the case of an (n,w r ,X) BIBD, the lower bound in (0 
may also be written as 1 + n ~\ ', the alternative form follows 
trivially from ([SJ. 

Proof: Consider the subset of the rows of H which forms 
the incidence matrix for a partial (w c , A) design. Let x be a 
nonzero pseudocodeword and let xg be a maximal coordinate 
of x (I £ T). For all j & J such that I sum inequalities 
©. We have 

w c ■ x e < X ■ 2J X % i 

iez\{(} 

or 

\ + ^) Xi <^2xi. (8) 



.mm 



The result now easily follows from the definition of w" 

Theorem 5.2: Let C be a code with parity-check matrix 
H, such that a subset of the rows of H forms the incidence 
matrix for a partial (w c , A) design. Then, 

w min > 1 "'" 

AWGNC — 



W 



A 

w c 

> 1 + 



The proof follows from Lemma 13.11 and Theorem 15.11 

Another tool for proving lower bounds of the mini- 
mum AWGNC pseudoweight is provided by the following 
eigenvalue-based bound from ifTTI . 
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Proposition 5.3: The minimum AWGNC pseudoweight 
for a (w c , uv)-regular parity-check matrix H whose corre- 
sponding Tanner graph is connected is bounded below by 

W AWGNC > n , (9) 

Ml — 

where /ii and n% denote the largest and second largest eigen- 
value (respectively) of the matrix L = H T H, considered as 
a matrix over the real numbers. 

In the case where H is equal to the incidence matrix for 
an (n,w r ,X) BIBD, it is easy to check that the bound of 
Proposition 15.31 becomes 



w; 



W c 



We conclude that in this case the bound of Proposition | 
coincides with that of Theorem 15.21 (for the case of the 
AWGNC only). 

Note that the pseudoweight bounds of [ 12] for the EG(2, q) 
and PG(2, q) codes for q — 2 s > 2 follow from Theorem 15.21 
We next apply the bounds of Theorems 15.11 and 15.21 to some 
other examples of codes derived from designs. 

Proposition 5.4: For to > 2, the [2 m - 1, 2 m - 1 - m, 3] 
Hamming code has AWGNC, BSC, and max-fractional pseu- 
docodeword redundancies p(C) < 2 m — 1. 

Proof: For to > 2, consider the binary parity-check 
matrix H whose rows are exactly the nonzero codewords 
of the dual code C x , in this case the [2™ - l,m, 2 m_1 ] 
simplex code. This H is the incidence matrix for a BIBD with 
parameters (n,w r ,X) = (2 m - 1, 2 TO_1 , 2 m ~ 2 ). Theorem O 
gives w mat ^ m (x) > 3, leading to p m , x . fiac (C) < 2 m — 1. The result 
for AWGNC and BSC follows by applying Theorem E2 ■ 

In the next example, we consider simplex codes. Straight- 
forward application of the previous reasoning does not lead 
to the desired result. However, more careful selection of the 
matrix H, as described below, leads to a new bound on the 
pseudoredundancy. 

Proposition 5.5: For m > 2, the [2 m - l,m,2 m - 1 ] 
simplex code has AWGNC, BSC, and max-fractional pseu- 
docodeword redundancies 



(2 m - l)(2' r 



1) 



P(C)< 



Proof: For m > 2, consider the binary parity-check 
matrix H whose rows are exactly the codewords of the 
dual code C x (in this case the [2 m - l,2 m - 1 - to, 3] 
Hamming code) with Hamming weight equal to 3. This H 
is the incidence matrix for a BIBD with parameters (n, w r , A) 
= (2 m - 1,3,1). Theorem O gives w^X > 2 m ~ x . 

Note that the number of codewords of weight 3 in the [2 m — 
l,2 m - 1 - to, 3] Hamming code is (2 m - 1)(2™- 1 - l)/3. 
This is due to the fact that there exists a 3 : 1 mapping from 
all vectors of length 2™ — 1 and weight 2 onto the codewords 
of weight 3. 

Next, we justify the claim that H is the parity-check 
matrix of C. A theorem of Simonis [13] states that if there 



exists a linear [n,k,D] code then there also exists a linear 
[n,k, D] code whose codewords are spanned by the codewords 
of weight D. Since the Hamming code is unique for the 
parameters [2 m — l,2 m — 1 — m, 3], this implies that the 
Hamming code itself is spanned by the codewords of weight 
3, so the rowspace of H equals C. 

The result for AWGNC and BSC follows again by applying 
Theorem 13.21 ■ 

We remark that the bounds of Propositions 15.41 and 15.51 are 
sharp at least for the case to = 3 and the max-fractional 
weight, see Section [VI-BI 

The following proposition proves that the AWGNC, BSC, 
and max-fractional pseudocodeword redundancies are finite for 
all codes C with minimum distance at most 3. 

Proposition 5.6: Let C be a [n,k, D] code with D < 3. 
Then p max _ rrat (C) is finite. Moreover, we have p„ r .^, I!lc (C) = n — k 
in the case D < 2. 

Proof: First assume that D < 2. Let H be any parity- 
check matrix for the code C, let x be a nonzero pseudocode- 
word, and assume that xg is a maximal entry in x (for 
some i E I). We always have Y^iex Xi — Xi ancl nence 
w m „_ fra( .(a;) > 1. 

Therefore, we may assume D = 2. Note that for such a 
code, H has no zero column and thus we may write by (O 



i£Z\{t} 



2x, 



From the definition of max-fractional weight, we obtain that 
w max , frac (a;) > 2. Choosing a parity-check matrix for C with 
n — k rows we have that p max _ frac (C) — n — k. From Theorem l3.2l 
Pawgnc(C) = n - k and p BSC (C) = n - k. 

Next, consider a code with minimum distance D = 3. 
Denote by H the parity-check matrix whose rows consist of 
all codewords of the dual code of C. Note that for a code of 
minimum distance D, a parity-check matrix H consisting of 
all rows of the dual code C x is an orthogonal array of strength 
D — 1. In the present case D = 3, and this implies that in any 
pair of columns of if, all length-2 binary vectors occur with 
equal multiplicities (c.f. lfl4l p. 139]). Thus the matrix H is 
an incidence matrix for a partial block design with parameters 
(w c , A) = (2 r_1 , 2 r ~ 2 ), where r = n — k. Therefore for this 
matrix H the code has minimum (AWGNC, BSC, or max- 
fractional) pseudoweight at least 1 + w c /X = 3, and it follows 
that the pseudocodeword redundancy is finite for any code 
with D = 3. ■ 

VI. Some Experimental Results 

A. Cyclic Codes Meeting the Eigenvalue Bound 

We consider cyclic codes of length n with full circulant 
parity-check matrix H. Thus H = (Hj^ij^x is a square 
matrix with entries Hj : i — Cj-i for some vector c of length n, 
where all the indices are modulo n. This n x n matrix is then 
iu-regular (i.e. (iu, iu)-regular), where w — J2iex c i' so we 
may use the eigenvalue-based lower bound in Proposition 
to examine the AWGNC pseudocodeword redundancy. 
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TABLE I 

Cyclic codes up to length 250 with D > 3 meeting the 

EIGENVALUE BOUND 
parameters M>-regular comments 



4, namely 



[n, 1, n] 
[n, n — m, 3] 
[7,3.4] 
[15,7,5] 
[21,11,6] 
[63, 37, 9] 
[73,45, 10] 



2 

2 m-l 

3 
4 
5 



repetition code, n = 3. . . 250 
Hamming code, n = 2 m — 1, m = 
dual of the [7, 4, 3] Hamming code 
Euclidean geometry code EG(2,4) 
projective geometry code PG(2,4) 
Euclidean geometry code EG(2,8) 
projective geometry code PG(2,8) 



For the largest eigenvalue of the matrix L = H H we have 



2 , since every row weight of L equals 



Mi : 

w 2 . Consequently, the eigenvalue bound is 



> n- 



2w 



(J>2 



itr - fi 2 

where p,2 is the second largest eigenvalue of L. 

We carried out an exhaustive search on all cyclic codes C 
up to length n < 250 and computed the eigenvalue bound in 
all cases where the Tanner graph of the full circulant parity- 
check matrix is connected^ Table U gives a complete list of 
all cases in which the eigenvalue bound equals the minimum 
Hamming distance D and D > 3. In particular, the AWGNC 
pseudoweight equals the minimum Hamming distance in these 
cases as well and thus we have for the pseudocodeword 
redundancy Pawgnc(C) < n. 

B. The Pseudocodeword Redundancy for Codes of Small 
Length 

Let C be a binary linear code with parameters [n,k,D] 
and let r = n — k. Two parity-check matrices H and H' 
of C are called equivalent if H can be transformed into H' 
by a sequence of row and column permutations. In this case, 
w min (H') = w min (H'). 

We computed the AWGNC, BSC, and max-fractional pseu- 
docodeword redundancies for all codes up to length 9. Note 
that for this also all possible parity-check matrices (up to 
equivalence) had to be examined]! The main observations are 
the following: 

• If D > 3 then for every parity-check matrix H we have 
wZZc( H ) > 3- This is not true for the BSC. 

. If k = 2, then p AW0KC (C) = p BSC (C) = /Wta^C) = r. 

• For the [7,4,3] Hamming code C we have /3 A wgnc(C) = 
r = 3, p BSC (C) = 4, and p m ^{C) = 7. 

• For the [7,3,4] simplex code C we have p AV/GKC {C) — 
r = 4, p BSC (C) — 5, and /o milx _ rnit (C) = 7. There is (up 
to equivalence) only one parity-check matrix H with 



'This computation was done by a self- written C program. 

2 The enumeration of codes and parity-check matrices was done by self- 
written C programs. The minimum pseudoweight for the various parity-check 
matrices was computed by using Maple 12 and the Convex package by 
Matthias Franz, available at http://www-fourier.ujf-grenoble.fr/~franz/convex/ 



H 



1 1 

1 















It is the only parity-check matrix with constant row 
weight 3. 

• For the [8, 4, 4] extended Hamming code C we have 

Pawgnc(C) = 5, Pbsc(C) = 6, and p mia4ac (C) = 00. This 
code C is the shortest one such that p AW cNc (C) > r, and 
also the shortest one such that p m .^ hx (C) = 00. 
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